Koonin et al. Biology Direct (2017) 12:31 
DOI 10.1186/s13062-017-0202-5 


Biology Direct 


OPINION Open Access 


Inevitability of the emergence and 


® CrossMark 


persistence of genetic parasites caused by 
evolutionary instability of parasite-free 


States 


Eugene V. Koonin' @, Yuri |. Wolf! and Mikhail |. Katsnelson? 


Abstract 


Genetic parasites, including viruses and mobile genetic elements, are ubiquitous among cellular life forms, and moreover, 
are the most abundant biological entities on earth that harbor the bulk of the genetic diversity. Here we examine simple 
thought experiments to demonstrate that both the emergence of parasites in simple replicator systems and their 
persistence in evolving life forms are inevitable because the putative parasite-free states are evolutionarily unstable. 
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Background 
Nearly all cellular life forms are hosts to various types of 
genetic parasites that exploit functional systems of the 
host cells to replicate their own genomes [1]. Only some 
bacteria with highly reduced genomes that themselves 
lead a symbiotic or parasitic lifestyle seem to lack genetic 
parasites that undoubtedly have been lost during the re- 
ductive evolution of these bacteria from free-living an- 
cestors. Genetic parasites include viruses, transposons, 
plasmids and other semi-autonomous genetic elements 
(SAGE) [2] that display a broad range of relationships with 
the hosts, from acute antagonism, whereby a virus rapidly 
kills the host, to symbiosis when SAGE are not costly to 
the host and could even have beneficial effects [3, 4]. 
Strikingly, virus particles appear to be the most com- 
mon biological entities on earth. In most environments, 
the ratio between virus particles and cells varies between 
10 and 100 [4—7]. This enormous physical abundance of 
viruses is matched by vast genetic diversity so that most 
of the gene repertoire of the biosphere appears to be 
concentrated in viruses, even as exact number remain a 
matter of debate [8-10]. The prevalence of viruses in 
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the biosphere is also paralleled by the abundance of SAGE 
integrated in genomes of cellular life forms. Integrated 
SAGE are present in virtually all genomes of cellular or- 
ganisms (again, missing only in some intracellular symbi- 
onts and parasites), and in genomes of multicellular 
eukaryotes, SAGE-derived sequences quantitatively dom- 
inate the genome, comprising at least 50% of the DNA in 
vertebrates and up to 90% in plants [11]. Recruitment of 
sequences from SAGE for cellular functions is a common 
phenomenon that made substantial contributions to the 
evolution of cellular life forms [12-14]. 

The entire course of the evolution of life is a history of 
host-parasite co-evolution [15-17]. Being subject to the 
constant onslaught of genetic parasites, cellular life 
forms have evolved a plethora of defense mechanisms. A 
typical organism harbors and interacts with multiple 
types of genetic parasites (e.g. viruses, different families 
of transposons, and plasmids) which it holds at bay 
thanks to multiple defense strategies that include para- 
site exclusion, innate immunity and adaptive immunity 
[18-23]. The SAGE respond with counter-defense mech- 
anisms that range from simple mutational escape from 
defense to dedicated multigene systems that specifically 
inactivate host defense systems. Notably, defense systems 
and SAGE including their counter-defense machineries 
are tightly linked in evolution. Enzymes involved in the 
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mobility of SAGE, in particular, transposons are often re- 
cruited by host defense systems for roles in parasite gen- 
ome inactivation and other functions, and conversely, 
SAGE recruit components of defense systems that then 
evolve to become agents of counter-defense [24—26]. 

Thus, the arms race, along with cooperation, between 
genetic parasites and their hosts are perennial features 
of the evolution of life. Why is this the case? Why do 
the parasites emerge in the first place? And, could some 
cellular organisms actually get rid of the parasites 
through highly efficient defense systems? Empirically, 
the answer to the latter question seems to be negative. 
Conceivably, the general cause of the inability of the 
hosts to eliminate the genetic parasites is the unescap- 
able cost of maintaining sufficiently powerful defense 
systems [27-30]. Analysis of theoretical models of para- 
site propagation suggests that an important source of 
this cost, perhaps the primary one in microbes, could be 
that efficient anti-parasite defense has the side effect of 
curtailing horizontal gene transfer (HGT), which is an 
essential process in microbial evolution that allows mi- 
crobes to avoid deterioration via Muller’s ratchet [31, 32]. 
Another major factor could be the effectively unavoidable 
autoimmunity [29, 33, 34]. However, what about the first, 
arguably, the most fundamental question: why do genetic 
parasites evolve to begin with? Again, empirically, there is 
a strong impression that the emergence of such parasites 
is inevitable. Not only are they ubiquitous in cellular life 
forms but they also evolve in various computer simula- 
tions of replicator system evolution [35-39]. Furthermore, 
it appears intuitive: genetic parasites can be considered 
cheaters, in game-theoretical terms, and as soon as, in a 
replicator system, there is a distributable resource, such as 
a replicase, cheaters would emerge to steal that resource 
without producing their share of it [40]. These, however, 
are informal considerations. Here we ask the question: is 
it possible to develop a theoretical framework that would 
allow a formal demonstration of the inevitability of the 
emergence of genetic parasites in evolving replicator 
systems, or else, that parasite-free replicator systems are 
after all possible? 


Evolutionary instability of parasite-protected replicators 

Let us try, as a Gedanken experiment, to construct a 
self-replicating entity that is strictly resistant to parasites. 
Consider a simple system consisting of a replicator, ser- 
ving as a template for itself, and the replicase it encodes 
(Fig. 1). The replicator is assumed to contain the 
replicase-encoding signal (RES) (the replicase could be a 
protein, a ribozyme, under the RNA World model, or, in 
theory, any other entity capable of catalyzing replication 
of a template) and the replicase recognition signal (RRS). 
Although we attempt to make this discussion as general 
as possible, it seems relevant to note here that, at least, 
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Fig. 1 A replicator-replicase systems with heredity, variability and 
differential reproduction. The dotted arrow denotes differential 
reproduction of the copies of the original replicator that carry mutations 


in extant replicator systems, the signals that ensure the 
specificity of replication are relatively simple and are 
much smaller than protein-coding genes [41, 42]. 

Evolvability is a fundamental and inescapable prop- 
erty of such a replicator-replicase system [43]. To be 
evolvable, a system must possess three basic properties: 
1) heredity (whereby the location of progeny in the 
phenotype space is correlated with that of the parents), 
2) variability (whereby the progeny is not identical to 
the parents), and 3) differential reproduction (whereby 
the capability of a replicator to leave progeny is part of 
the phenotype). Heredity is ensured by replication with 
fidelity above the error catastrophe threshold. The rep- 
licator theory that was developed primarily by Eigen 
and colleagues demonstrates that, under simple fitness 
landscapes, there exists a replication fidelity threshold, 
below which the master sequence in a population of 
replicators cannot be efficiently passed across genera- 
tions, so that the entire population collapses [44, 45]. 
Elucidation of the molecular mechanisms of primordial 
replication that could provide for crossing the error ca- 
tastrophe threshold remains a daunting task that is cen- 
tral to the entire origin of life field. However, for the 
purpose of the present discussion, we assume that a 
sustainable replicator system with a minimally accept- 
able fidelity has evolved. 

Variability is ensured because, at any temperature 
above 0 K, any process is subject to entropy-increasing 
fluctuations and, therefore, replication is inherently 
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error-prone, under the second and third laws of thermo- 
dynamics, and given a finite energy supply. 

Differential reproduction ensues from the fact that the 
replicator encodes the replicase that, in turn, copies the 
replicator itself. Mutations in both RES and RRS can 
affect the efficiency of replication. 

If the resources that are available to the system are 
limited (i.e. the system cannot support unlimited growth 
of all possible constituent parts), competition between 
individual replicators ensues and selection arises. In a 
system with finite memory storage, all information ex- 
change, transfer and utilization processes carry a mem- 
ory clearing cost of at least k71n2 J/bit, where k is 
Boltzmann constant and T is temperature. The existence 
and value of this minimum information cost is known as 
Landauer’s principle [46] which is a corollary of the sec- 
ond law of thermodynamics, under the information- 
theoretic interpretation of thermodynamics [47-49] In 
all known systems, this cost is many orders of magni- 
tude higher [46, 48, 49]. Therefore, selection for cost re- 
duction acts not only on the constituent parts of the 
system, but also on the information transfer processes 
themselves, effectively ensuring an upper limit on the fi- 
delity of information transmission. 

Selection acts on both RES (eliminating replicators en- 
coding inefficient replicases) and RRS (eliminating repli- 
cators that are inefficient as templates, e.g. are poor 
replicase-binders), but these two selection processes act 
on the replicator through physically different agents (the 
replicator-encoded replicase and the replicator itself, 
respectively). 

The dual nature of the replicator (acting as both the 
template and, directly or indirectly, as the replicase) ne- 
cessitates that the information embedded in the RES and 
RRS is realized via physically different processes. The 
RES guides the formation of the replicase which, in turn, 
recognizes the RRS. Such recognition implies comparing 
the RRS in the replicator with some form of memory 
encoded in or attached to the replicase (Fig. 2). 

A general, simple way of parasite emergence involves 
skipping part of the RES during replication, resulting in 
a shorter replicator that consists of the RRS and, in the 
extreme, nothing else (Fig. 3). This straightforward 
mechanism for parasite evolution is inspired by and is 
similar to the process of RNA shrinking that was ob- 
served during in vitro evolution in the classic early ex- 
periments of Spiegelman and colleagues [50-52]. 

Under the scheme in Fig. 3, parasites emerge as long 
as the information content of the RRS is less than that 
in the full replicator, i.e. when the RRS is at least par- 
tially separable from the RES. If this is the case, a repli- 
cator containing the full RRS, but omitting at least some 
of the RES (RRS, = RRS, RES, - >0; the subscript ‘p’ de- 
notes the respective signals in the parasite), would not 
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Fig. 2 The replicase-encoding signal (RES) and replicase-recognition 
signal (RRS) in replicator-replicase systems. For generality, the RRS is 
shown as being distributed along the length of the replicator although 
in real genomes, this signal is often localized such that, for example, 
short terminal sequences are sufficient for the replication of a virus 
genome. The replicase structure carries memory of the RRS allowing 
recognition of competent templates (“pass/block mechanism’) 





only serve as a template as efficient as the original replica- 
tor, but would also enjoy an evolutionary advantage be- 
cause replication of the smaller replicator is faster and 
requires less resources (building blocks, such as nucleo- 
tides, and energy). This makes the parasite-free equilib- 
rium point of the replicator-parasite system unstable 
because deletion of any part of the RES yields more effi- 
cient replicators (Fig. 4). Therefore, the system is vulner- 
able to parasite invasion, and moreover, such an invasion 
is inevitable under a non-zero parasite emergence rate 
(see Appendix for a more formal demonstration). 
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Fig. 3 Emergence of parasites in replicator systems via deletion of 
portion of the RES 
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Fig. 4 A conceptual phase diagram of the evolution of replicator systems 





It appears that, under this scheme, the only way to 
render the _ replicase-producing replicator parasite- 
protected is to make the RRS to include the entire RES 
(Figs. 2 and 4). Such RRS=RES configuration evidently 
rules out the emergence of a parasite because any muta- 
tion of the RES would also inactivate the RRS and pre- 
vent replication. 

However, such a parasite-protected state is subject to 
the aforementioned instability (Fig. 4). In the absence of 
parasites, perfect protection does not carry any benefits, 
but incurs a greater cost than less protected states. 
Given that the system is evolvable, an RRS < RES state 
will inevitably arise and outcompete the RRS = RES pro- 
genitors that are, as shown above, prone to emergence 
of genetic parasites (RRS, = RRS, RES, - > 0). 

From a more abstract perspective, the fully protected 
RRS = RES system corresponds to the maximally con- 
strained, i.e. minimum entropy, state. The second law of 
thermodynamics (once again, under the assumption of a 
limited energy supply) effectively guarantees that it 
evolves into a higher entropy state, such that RRS < RES, 
and at least some parts of the RES can be mutated or 
deleted without compromising replication. An additional 
important and highly plausible assumption predicating 
this conclusion is that the entire replicator system is 
stable on the time scale of RRS evolution, and accord- 
ingly, the evolving system can be considered to be at 
quasi-equilibrium. The ensemble of higher entropy 
states is obviously more robust than the unique RRS = 
RES state. In biological parlance, the higher entropy 
states are favored by selection via the ‘survival of the flat- 
test’ route [53]. They will necessarily prevail because 
there are plenty of such states with similar fitness values, 
whereas the RRS=RES state is singular. However, the 
problem with the ‘relaxed’ states of the replicator is that 
they are no longer protected from parasites because a 
parasite now can evolve that would exploit the RRS 
without producing the replicase (Fig. 3). The third law of 
thermodynamics dictates that the minimum (zero) 
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entropy state can be stable only at 0 K. Under the de- 
tailed correspondence between thermodynamics and 
population genetics [54], the equivalent of temperature 
is the inverse effective population size, and accordingly, 
O K corresponds to an infinite population, which can 
exist only as an abstraction. Thus, the two complemen- 
tary phenomena, i.e. microscopic replication errors, that 
are statistically inevitable at a non-zero physical 
temperature, and fixation of neutral or slightly deleteri- 
ous mutations in a population, which in its turn is statis- 
tically inevitable in a finite population, jointly ensure 
that a parasite-free replicator system is inherently un- 
stable and either goes extinct or rapidly spawns parasites 
(Fig. 4). The latter case inevitably triggers the host- 
parasite arms race, whereby in the simplest case, the 
hosts evolve by selection for changed RRS allowing them 
to escape the parasites, whereas the parasites catch up. 
Furthermore, the competition also occurs between the 
parasites themselves and could eventually result in the 
emergence of ultimate parasites, those that consist en- 
tirely of the RRS. Such (near) ultimate parasites were the 
end result of Spiegelman’s experiments [52] and also 
exist in nature, namely the viroids, small, non-coding 
parasitic RNAs that cause disease in plants and rely en- 
tirely on a host-derived replication machinery [55, 56]. 

It should be noted that, because, at least in the simplest 
replicator system, the replication rate is inversely propor- 
tional to the genome length, the parasites have an intrinsic 
advantage in the arms race. In the well-mixed case, the 
preferential replication of parasites drives the host to ex- 
tinction which, obviously, results in the collapse of the en- 
tire system (no replicase is produced anymore).However, 
spatial heterogeneity (compartmentalization) can stabilize 
host-parasite systems even in the absence of active 
defense. Thus, parasites drive evolution of biological com- 
plexity [37—39, 57]. 

There seems to be a symmetry between the hypothet- 
ical, minimum entropy, parasite-resistant replicator and 
the ultimate parasite with RRS, = RRS, RES, = 0, another 
minimum entropy state, this one being the theoretical 
end result of the competition between parasites (Fig. 3). 
As a minimum entropy state, the ultimate parasite can- 
not be evolutionarily stable either. The Gedanken ex- 
periment described above certainly is an idealization. 
Realistically, when the minimum entropy state relaxes, 
the host-parasite arms race takes more complex forms. 
Most parasites are far from this ultimate state but rather 
possess a number of genes and encode a variety of func- 
tions. This complexity of parasites has to do with two 
strategies that parasites evolve to maximize their evolu- 
tionary success, namely: 1) overcoming the defense sys- 
tems which the hosts evolve under the pressure for 
resistance to the parasites, and 2) surviving outside the 
host and disseminating among hosts [4]. The question, 
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then, emerges: even though the above analysis shows 
that evolution of parasites in simple replicator systems is 
inevitable, is there a chance that evolution of defense 
systems would exterminate parasites? 


Costs and compromises of anti-parasite defense 
The thought experiment described above also answers 
the question whether a perfect defense system can exist. 
A perfect self vs non-self discrimination, whereby a rep- 
licator possesses the means to reject or destroy any po- 
tential cheater, that is, any sequence other than a perfect 
copy of itself, is nothing but the same parasite-protected 
system with recognition based on the complete informa- 
tion on the self, ie. RRS = RES. We have already shown 
above that such a system is evolutionary unstable from 
pure thermodynamic considerations because it provides 
no benefit in the absence of parasites, and will inevitably 
devolve to an RRS < RES configuration (“leaky” defense). 
The existence of an unavoidable cost implies that 
maintenance of any form of defense is subject to a cost- 
benefit tradeoff. Notably, a recent quantitative assess- 
ment of the selection coefficients (a measure of fitness 
cost) associated with different classes of genes in micro- 
bial genomes has shown that defense systems are as 
costly as the more benign SAGEs, such as transposons 
[30]. This result seems to be a genome-scale reflection 
of the intrinsic costliness of defense dictated by thermo- 
dynamics. In the actual biological context, this cost can 
be manifested in different forms, such as autoimmunity 
or interference of defense systems with HGT. The cur- 
tailment of HGT, in particular, could be a key factor that 
makes defense systems costly and causes their repeated 
loss [32]. However, at the bottom of it all seems to be 
the thermodynamic cost of information. Certainly, 
defense is not the only process associated with an infor- 
mation cost. Such cost is intrinsic to all processes of infor- 
mation transmission including replication, transcription, 
translation, as well as signal transduction. However, loss of 
genes encoding those other functions is often strongly 
deleterious to the organism, resulting in positive mean se- 
lection coefficients for the respective classes of genes [30]. 
This is not the case for defense systems: the mean selec- 
tion coefficient values for the defense genes are negative 
[30], and in accord with this observation, defense genes 
are lost in the course of evolution significantly more often 
than genes of other functional categories [58]. Due to 
these fitness costs, evolving organisms cannot build up 
defense systems to the extent that is required to eliminate 
parasites (Fig. 4). 


A quasi-formal demonstration of the inevitability of the 
emergence and persistence of parasites 

In the above, we demonstrate that a parasite-protected 
state, one in which RRS=RES, is thermodynamically 
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unstable. The inevitability of the emergence of parasites 
and their subsequent persistence follows from this dem- 
onstration. To present the argument succinctly and 
quasi-formally: 


(1) The two signals that are essential in a replicator 
system, RRS and RES, are encoded in different 
genomic sequences. 

(2)Thus, at least some parts of the RES can be 
deleted without inactivating the RRS. Hence 
genetic parasites emerge. 

(3)The shorter the genome sequence of a genetic 
element the more efficient its replication is. 
Hence parasites accumulate in a replicator 
system and may bring it to collapse in a 
well-mixed case. 

(4) A perfect defense system should, in the least, 
be able to recognize parasitic elements, i.e. 
detect missing parts of the host genome. At 
the end of the replication cycle, this information 
should be deleted. Under the Landauer 
principle, the cost of memory cleaning is at 
least kTnln2 J (1 bits distinguish the host and 
the parasite). This cost makes defense systems 
evolutionarily unstable and precludes 
elimination of parasites. 


Conclusions 

The problem of the ubiquity and persistence of genomic 
parasites throughout the evolution of life can be broken 
into two parts: i) emergence of parasites in primitive 
replicator systems, ii) persistence of parasites in evolving 
organisms. Our analysis of the basic aspects of host- 
parasite coevolution presented here suggests that there 
are fundamental thermodynamic causes of the inevitabil- 
ity of parasites on both stages. To put it most succinctly, 
both the hypothetical parasite-protected state in a simple 
replicator system and the putative secondary parasite- 
free state resulting from efficient action of advanced 
defense systems are evolutionarily unstable and can only 
exist transiently. These conclusions certainly are com- 
patible with a wealth of observational data indicating 
that genetic parasites, along with defense systems, are 
enormously abundant in the biosphere and accompany 
virtually every cellular life form. The results of numerous 
mathematical and computational modeling studies on 
replicator evolution, in which parasites invariably appear, 
lead to the same conclusion. The simple thought experi- 
ments described here that start, effectively, from first 
principles emphasize the growing understanding that 
emergence as well as persistence of genetic parasites is 
an inalienable feature of evolving replicators and, as 
such, one of the central principles of biology. 
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Appendix a 

Consider a simple system in which both the replicator 
and a parasite comprise single entities and the repli- 
cator entity can also function as the replicase (e.g., both 
the replicator and the parasite are single RNA mole- 
cules, and the replicator molecule in the native folded 
state can function as the replicase ribozyme). Repli- 
cation of the replicator entity is governed by the 
second-order kinetics, where a template replicator 
meets an (identical) replicase to make a copy of the 
template. There is also a natural decay of both types of 
entities, occurring with the first-order kinetics. In an 
environment with a fixed carrying capacity (deter- 
mined, e.g., by the influx of consumable resources), 
such a system behaves as a classical logistic model, with 
one exception: there exists a critical number of entities, 
below which the population is unsustainable due to the 
(second-order) replication lagging behind the (first- 
order) decay. 

Likewise, a parasite entity replicates upon meeting a 
replicator (acting as a replicase) and decays spontan- 
eously. The parasite population is subject to similar en- 
vironmental restrictions as that of the replicator due to 
the limitation by the same resources. 

The general model of evolution employed here is derived 


from the classic logistic equation [59], an = rX(1-2) , 


where X is the population size, r is the growth rate and K 
is the population carrying capacity. Under the logistic 
model, the growth is limited by utilization of available re- 
sources influx by the growing population; the equilibrium 
is reached at X = K. 

For the purposes of this work, we modify the logistic 
model in the following way. First, we consider separately 
the populations of the bona fide replicators and the para- 
sites (R and P respectively). Then, we explicitly introduce 
the death process at the rate proportional to the popula- 
tion size (epR and epP) and the second order kinetics of 
replication, dependent on the interaction of the replicator 
and the template (R* for the replicator and RP for the 
parasite). The replicator and the parasite utilize the same 
resources (the latter being more economical by a factor of 
q), so both contribute to the numerator above the carrying 
capacity (R + P/q); also, the parasite replicates faster than 
the replicator by the same factor g (the simple conceptual 
model of this effect is based on an RNA molecule that is 
shorter than the replicator by a factor of q). Finally, the ex- 
istence of a (potentially costly) defense system, embedded 
in the replicase, is taken into account in the form of a co- 
efficient, dependent on the defense systems efficiency e”. 
The action of the system inhibits the growth of the para- 
site but incurs a cost to the replicator that is also 
dependent on e* with the cost coefficient a (according to 
the Landauer principle, a > 0): 
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(the replicator growth rate is taken to be 1 without loss 
of generality). 

Upon introduction of the parasite to the population 
of the replicator near the equilibrium (P—0, dR/dt 
— 0), the host defense prevents the parasite invasion 
(dP/dt <0) only if 


1+e* er | 

T+ae~ 44 
or, in other words, only if the effect of defense on the in- 
vading parasite relative to the cost of defense to the host 
is greater than the parasite advantage g (assuming com- 
parable decay rates). 

Obviously, the replicator population lacking the 
defense system (e*=0) cannot resist the parasite inva- 
sion. A defense system combining high efficiency with 
low cost is capable of protecting the host population, 
but the maintenance of such defense depends on regular 
invasion of parasites (otherwise, the evolutionary disad- 
vantage due to the cost of defense would drive the 
defense system to extinction). In other words, a parasite- 
protected state of replicator system is unstable. 


Reviewers’ reports 

Reviewer 1: Yitzhak Pilpel (Weizmann Institute of Science) 
This is a very elegant argument that indirectly explains 
the common observation that essentially all non- 
parasitic life forms are subject to parasites. The argu- 
ment is valid, the model is adequate and the paper is 
very well written. As such it will make a nice contribu- 
tion to the field, and publication is Biology Direct is 
certainly recommended. 

This paper, on the inevitability of parasites in bio- 
logical systems, provides a thermodynamics — based for- 
mal argument that parasitic (viral like) systems are 
bound to emerge and persist in any biological system. 
As ‘biological systems, the authors consider a simplest 
cases of a replicator that is encoded from a ‘genome’ 
which consists of a region that encodes for the replicase 
(RES), and a recognition site within that genome (RRS) 
that is to be recognized by the replicase to ensure repli- 
cation. A parasite can emerge as a variant of the replica- 
tor which omits at least part of the RES, thus saving 
replicase expenses, but that nonetheless preserves the 
RRS to a sufficient extent such that it may still get repli- 
cation services from the host replicase. A key to the 
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argument is the host’s defense system that, if abstracted 
appropriately as done here, can be thought of as means 
for a host replicator system, to discriminate against a 
parasite RRS’s in favor of the host’s, in a replication 
cycle, yet at the investment of costly resources. The 
question then becomes, how stable is a system that con- 
sists only of a host replicator without a parasite, e.g. in 
the face of ‘invasion’ of a parasite, or simply when host 
replicators, inevitably mutate into parasites. The argu- 
ment, presented as a thought experiment, and also as a 
mathematical formal model is simple: the hosts will sus- 
tain the costly defense system, provided that it is suffi- 
ciently discriminative in its favor, provided that the 
parasite emerges frequently enough. If the parasite is 
defeated the host will lose the defense system, and will 
hence become sensitive to a future attack. This means 
that the parasite free system was instable. This is a very 
elegant argument that indirectly explains the common 
observation that essentially all non-parasitic life forms 
are subject to parasites. The argument is valid, the 
model is adequate and the paper is very well written. As 
such it will make a nice contribution to the field, and 
publication is Biology Direct is certainly recommended. 

Response: We appreciate the constructive assessment of 
our work. 

I do have comments and suggestions though. 1. The 
potential protected state defense “RES = RRS”, is shown 
to be instable due to cost considerations. Yet some po- 
tential embodiments of the RES=RRS solution could 
have little or no costs. For example, in real-genomics 
terms, if the origin of replication of the host was embed- 
ded within the sequence encoding for the DNA replica- 
tion machinery, protection could be provided with no 
cost. Could the authors comment on this possibility? I’m 
wondering if real genomes feature such a solution. 

Response: Actually, as shown in Fig. 2, we explicitly 
consider the possibility that the RES and the RRS over- 
lap. A protected state emerges only when RES = RRS, 
and this state is inherently unstable. 

2. Can the model account for “nested parasitism”, i.e. a 
parasite of a parasite. For example consider the case of a 
SINE and a LINE, in which the SINE could limit the 
spread of a LINE — what are the stable solutions of a 
three-way system that consists of a host, primary parasite 
and a secondary one? 

Response: nested parasitism is indeed a very interesting 
phenomenon that has been explored using mathematical 
models as well as experimentally, in particular, for the 
virus-virophage-host systems. However, this is beyond the 
scope of the present analysis that deliberately focuses on 
the most basic host-parasite systems. 

3. The model (esp as stated explicitly in its math em- 
bodiment) assumes that the only way through which the 
parasite inhibits the replicator host is due to their 
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sharing the growth resources ((R + P/q)K), while in real- 
ity some parasites such as viruses can inflict more severe 
damage. I wonder if effects such as lytic infection should 
be modeled explicitly. 

Response: Once again, we consider the most funda- 
mental host-parasite systems in which the damage 
inflicted is minimal, i.e. indeed limited to resource appro- 
priation. In biological terms, this corresponds to the first 
replicators, supposedly, within the hypothetical primor- 
dial RNA world. The lytic viruses as well as symbiotic 
plasmids are products of subsequent evolution. The di- 
versification of parasite strategies is an important subject 
for theoretical study that we expect develop in the future 
but not in this paper. 

4. Regarding the mathematical model of Appendix A, 
a few comments. (i), it is not clear to me why the dis- 
crimination efficiency factor e* appears identically on 
the replication and parasite eq. I’d imagine it appearing 
only on the parasite, and depending on its exact defin- 
ition (which is missing), perhaps appearing as 1-e* in the 
replicator equation. (ii), I’d recommend for the non- 
math oriented reader to introduce first the basic logistic 
model (dx/dt = rx(1-x/k)-ex). (iii) the steady state stabil- 
ity analysis is not provided in a rigorous fashion, if the 
authors go through presenting the ODEs they could per- 
haps do the extra step of formally identifying all the 
steady state and analyze their stability. (iv) related — it 
was not clear to me if Fig. 4 represents such stability 
analysis, that is directly based on the equations, or is it 
hand drawn result of the thought experiment. 

Response: (i) Actually, the discrimination efficiency fac- 
tor e* does not appear identically in the equations for the 
replicator dynamics and the parasite dynamics: in the 
former, it appears with the cost coefficient a. Less for- 
mally, we chose a simple generic function which is re- 
quired to have the following properties: defense reduces 
the replications rates of both the host replicator and the 
parasite, and the magnitude of this effect is co-monotonic 
with e*; (ii) In the revised version of the Appendix, we fol- 
low this suggestion; (iii) we believe that the simple ana- 
lysis described in the paper is sufficient to demonstrate 
the instability of the parasite-protected state in the ana- 
lyzed toy model; complete analysis of phase portraits in 
more complex systems is beyond the scope of the paper; 
(iv) Fig. 4 is indeed a hand drawn results of the thought 
experiment (see point iii). 


Reviewer 2: Bojan Zagrovic, University of Vienna 

The authors provide a novel and potentially far-reaching, 
albeit largely qualitative argument concerning the thermo- 
dynamic necessity of genetic parasite evolution ie. in- 
stability of parasite-free replicator systems. Given the 
complexity of the problem at hand and the simplicity and 
the generality of the proposed framework aimed at 
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addressing it, there by necessity exists a degree of vague- 
ness in the abstract arguments presented, but this I do not 
hold against the authors. They are attacking a fundamen- 
tal question and their arguments involve only the funda- 
mental tools e.g. the second law of thermodynamics - this 
generality is in fact the main strength of the article, which 
makes it worthy of being published, studied and poten- 
tially extended. 

Response: we appreciate the constructive assessment of 
our work. 

While I found the article stimulating, my main concerns 
relate to the difficulty of understanding the thermo- 
dynamic aspects of the authors’ arguments. Specifically, 
the authors interchangeably refer to thermodynamics in a 
classical, physical sense and, at the same time, in the con- 
text of a previously demonstrated analogy with population 
genetics formalisms. For example, the term temperature is 
used both in its standard, physical sense and in a sense of 
population genetics, where zero temperature refers to 
populations of infinite size. Similarly, when they speak 
about, they refer to free energy cost of memory clearing in 
its canonical sense, while on the other hand it is not clear 
what this would mean in the population genetics sense. 
This dual usage adds to the difficulty of following and fully 
appreciating the main argument. It would be beneficial if 
the authors devoted more space to clarifying the distinc- 
tion between the two. 

Response: we believe that both the traditional thermo- 
dynamic perspective and its counterpart in population 
genetics are relevant and important. To connect the two, 
we added the following in the revised manuscript: “Thus, 
the two complementary phenomena, i.e. microscopic 
replication errors, that are statistically inevitable at a 
non-zero physical temperature, and fixation of neutral or 
slightly deleterious mutations in a population, which in 
its turn is statistically inevitable in a finite population, 
jointly ensure that a parasite-free replicator system is in- 
herently unstable and either goes extinct or rapidly 
spawns parasites (Fig. 4).” 

On the other hand, I have found the authors’ argu- 
ments concerning a “perfect” defense system in which 
RES = RRS to be very stimulating and potentially rele- 
vant in the context of another question of a wide 
biological significance, that is, the origin of the universal 
genetic code. Namely, among different hypotheses, the 
stereochemical hypothesis of the code’s origin has received 
significant attention over the past decades [60, 61]. Specif- 
ically, the hypothesis suggests that the code evolved from 
direct interaction preferences between amino acids and 
their cognate codons. Recently, we have generalized this 
hypothesis to suggest that if amino acids should interact 
specifically with their cognate codons, then oligopeptides 
should also interact with their cognate mRNAs in a com- 
plementary, co-aligned fashion, especially if unstructured 
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[62-64]. In fact, we could demonstrate that the 
nucleobase-density profiles of modern day mRNA coding 
sequences exhibit a close matching with the correspond- 
ing nucleobase affinity profiles of their cognate protein se- 
quences, giving support to the generalized version of the 
stereochemical hypothesis. Now, a putative primordial 
scenario in which such complementary interactions could 
have taken place is one in which a given mRNA encodes a 
corresponding protein, but also directly interacts with it 
(mRNA as a direct template for synthesis of a protein). If 
we envision that the protein in question is the ancient rep- 
licase and that the two interact along the whole sequence, 
then we have a situation in which RES= RRS. In other 
words, it appears that the generalized version of the 
stereochemical hypothesis may actually be consistent with 
a picture of a replicator with a perfect (or, at least, strong) 
defense system against parasitic genetic elements, as dis- 
cussed by Koonin et al. As the authors discuss, such a per- 
fect system may be unstable, but this still allows for a 
possibility that it serves as a useful asymptote towards 
which evolution might gravitate towards. What I then find 
particularly exciting is the possibility that such a drive to- 
wards a high-level (although perhaps not perfect) discrim- 
ination between hosts and parasites might have led to the 
development of the universal genetic code as we know it. 
In other words, it is possible that the code is as it is, in 
part because of an attempted optimization of high-level 
host-parasite discrimination. I wonder if the authors 
would care to comment on this possibility and perhaps 
develop it even further. To the best of my knowledge, I 
have never encountered any similar discussion linking the 
origin of the genetic code with the evolution of the host- 
parasite relationship. 

Response: this is definitely an interesting line of discus- 
sion but in the context of the current article, we address 
a more coarse-grained model to address the basic princi- 
ples of parasite emergence and evolution. 


1. “gedunken experiment” should be spelled “Gedanken 
experiment”. 


Response: we regret the error; corrected. 


Reviewer 3: Eric van Nimwegen, Biozentrum, Basel 
This is an interesting paper for people interested in the 
evolution of parasites. While the general evolutionary ar- 
guments are valid, the authors also suggest in various 
places that there is some more fundamental basis for 
their arguments rooted in thermodynamics and those ar- 
guments are really just wrong and based on a misunder- 
standing of the physics. 

Response: We appreciate both the general positive as- 
sessment and the criticism. Below, however, we argue that 
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the thermodynamic considerations are relevant and cor- 
rect, after appropriate modification. 

The authors present a number of arguments and 
gedanken (not ‘gedunken’ as was written in the manu- 
script) experiments for why the emergence of parasites 
is virtually guaranteed within systems of replicators. | 
generally agree with the authors that situations without 
parasites are very hard to stabilize from an evolutionary 
perspective, and the authors do a reasonably good job of 
giving some insightful arguments for why this is. This is 
especially useful to make clear that one doesn’t need 
special explanations for why arms races between replica- 
tors and their parasites persist in evolution. Those parts 
of the paper I liked and agree with. There’s two things 
that I didn’t agree with. First, the writing suggests that 
the authors believe their arguments almost amount to 
something like a mathematical proof that situations 
without parasites can never be evolutionarily stable. I 
don’t believe this is correct. I don’t believe that the argu- 
ments in the paper have really exhaustively covered any 
possible conceivable replicator system, i.e. I simply do 
not believe that we have as much imagination as nature 
might have in this respect. Moreover, even for the situa- 
tions discussed in the paper, I can think of some caveats 
that could threaten the argument (mentioned below). 
Even if I don’t think these are likely to occur in practice, 
they do affect the ‘mathematical rigor’ of the argument. 
This criticism can be easily addressed by toning down 
the language regarding how definitive the arguments are 
that are being presented. 

Response: Certainly, the model presented in the paper 
does not exhaust all conceivable replicator systems. There 
is no claim of such universality in the article. However, 
we do submit that the model represents the most basic 
replicator system readily imaginable. We explicitly state 
the assumptions under which the model is applicable 
(these assumptions have been clarified in the revision as 
indicated below). We do not claim a mathematical proof 
of inevitability of parasites (which is impossible by defin- 
ition) but we do believe we present strong arguments 
rooted in fundamental physical principles. 

The parts of the paper that I have most trouble 
with concern the attempts to make connections be- 
tween what are essentially arguments about evolution- 
ary dynamics, and notions from thermodynamics, 
including the second and third laws, and Landauer’s 
principle. I strongly feel that these thermodynamic 
considerations have nothing whatsoever to do with 
the topic of this paper and I believe the paper would 
be much better if all these references to thermo- 
dynamics were removed. I know that there is an 
enormous amount of confusion and misleading argu- 
ments in the literature around the topics of the sec- 
ond law, the arrow of time, the relation between 
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thermodynamics and computation, and the relation 
between thermodynamics and biological evolution. I 
get the impression that the authors have fallen prey 
to some of these misleading arguments. The issues 
are technically simple but conceptually full of pitfalls 
and are really beyond the scope of a small review like 
this. I will just make a few remarks and point to lit- 
erature that I find relevant. First, regarding the connec- 
tions between population genetics and thermodynamics. 
This is really nothing but the appearance of similar math- 
ematics in different subject areas. The reason that such 
similar mathematics appears is because the maximum- 
entropy methods of statistical physics are general methods 
of statistical inference, that will apply in many situations, 
including population genetics. I refer the authors to the 
work of E.T. Jaynes, who has explained this most clearly 
in my opinion (see, e.g., chapter 11 of ‘probability theory, 
the logic of science’). The relations between the second 
law, irreversibility, and computation all ultimately stem 
from the fact that the laws of physics are reversible at the 
microscopic level (note that we are here not concerned 
with quantum measurement, collapse of wave-functions, 
or many universes splitting, etcetera. That’s conceptually 
an even much bigger can of worms that I would prefer to 
keep closed here). Because of this microscopic reversibil- 
ity, Liouville’s theorem guarantees that any collection of 
states in microscopic state space retains its volume under 
time dynamics. One can think of a thermodynamic 
’‘macrostate’ as nothing but a particular collection of 
microscopic states and the entropy of this macrostate es- 
sentially corresponds to the logarithm of its volume. Since 
volume is conserved, it is impossible for a macrostate to 
evolve into another macrostate with LESS volume, i.e. en- 
tropy cannot go down. This is the essence of the second 
law. (See, for example, Jaynes, E. T., 1965, ’Gibbs vs Boltz- 
mann Entropies,; Am. J. Phys., 33, 391 which, in addition 
to many other conceptually enlightening papers can be 
downloaded from http://bayes.wustl.edu/etj/nodel.html). 
Similarly, whenever a subsystem undergoes some dy- 
namics that is irreversible in the sense that multiple 
input states map to the same output state, this corre- 
sponds to a contraction in the volume of microstates of 
this subsystem. Since the microscopic laws are time re- 
versible, the joint volume in state space for the subsys- 
tem plus its environment must be conserved, and thus 
a contraction in volume for the subsystem must be ac- 
companied by an expansion of the volume of the envir- 
onment. By the definition of entropy, this expansion 
corresponds to an increase in entropy. This is the rea- 
son why all systems that sustain irreversible behavior 
must be dissipating entropy (and, at finite temperature 
kT times as much energy) to their environment. Land- 
auer’s principle is just one example of this, ie. erasing 
memoty is irreversible. However, none of this has much 
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of anything to do with the evolution of replicators and 
their parasites. 

Response: as it could be suspected, the authors are fa- 
miliar with the work of Jaynes and related literature. There 
is no argument at all regarding the validity of these con- 
cepts and theory. What we fail to see, is why Landauer 
principle is irrelevant when it comes to the evolution of 
replicators and their parasites. It defines the low bound on 
the energy cost and hence constrains evolution. In the revi- 
sion, we specifically refer to the information-theoretic inter- 
pretation of thermodynamics and cite Jaynes’ book. 

The key point is that any replicating system is con- 
stantly performing irreversible operations (e.g. when it is 
copying itself) and dissipating energy in the process. Ar- 
guing against defense systems because they require en- 
ergy dissipation seems fundamentally misguided to me 
because energy dissipation is what replicating systems 
do by definition (ie. take energy sources from the envir- 
onment and turn it into copies of themselves). There’s 
no reason to believe that evolution would act to 
minimize energy dissipation in their functioning. In fact, 
it's probably more accurate to think of evolution as a 
competition between replicators for dissipating energy 
from the environment as fast as possible, i.e. faster repli- 
cating cells are dissipating energy from the environment 
faster. I realize this is a fundamental discussion that is 
much beyond the scope of this review but I have really 
no doubt that the viewpoint that the authors present is 
simply incorrect on the connection between evolution and 
physics, and I think it is really unhelpful to further add to 
the confusion on this topic. I feel the evolutionary argu- 
ments are valid without any appeal to thermodynamics 
and the paper would be much improved if thermodynam- 
ics (and Landauer’s principle) were not mentioned at all. 

Response: This is a very interesting comment where we 
seem to disagree with the reviewer on substance. We 
think there is every reason to believe that, in a finite re- 
source situation, selection does act to minimize energy 
dissipation by evolving replicators. Actually, the state- 
ment that replicators evolve to dissipate energy as fast as 
possible is outright false in its simplest form. Fast replica- 
tion leads to an increased error rate and eventually to 
collapse of any replicator systems. Hence the trade-off 
between replication rate and accuracy that can lead to 
different optimal solutions during evolution. Evolution- 
ary success of many fastidious organisms is broadly 
recognized, and in spatially organized environments 
slow-growing strains actually systematically outcompete 
fast-growing ones |65, 66]. 

Regarding ‘adding to the confusion, we certainly must 
avoid this. In the revision, we emphasize in several 
places that we address here energy costs under the lim- 
ited resources assumption not simple application of laws 
of thermodynamics under equilibrium. With these 
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qualifications, we believe that the physical considerations 
here are relevant and even essential. 

Some more detailed comments: page 5, lines 98-100: 
“replication is inherently error prone” I believe this is a 
misleading statement. As Hopfield’s kinetic proofreading 
principle has shown, one can use dissipation of energy 
to make copying as accurately as one wants. There's a 
whole field on error-correction mechanisms in computa- 
tion that show that, if one is willing to pay energy, there 
are no fundamental bounds on the accuracy that can be 
achieved. Page 5, line 110: “in all known systems, this 
cost is many orders of magnitude higher” I do not 
understand what ‘this cost’ is referring to here. 

Response: We appreciate the comment. Once again, 
the quoted statement required a crucial qualification 
that replication is error-prone given a finite energy re- 
source which, obviously, is the case in any realistic situ- 
ation. There is no doubt that dissipation of energy can be 
harnessed to achieve any desired accuracy of copying but 
the problem is the energy cost of such solutions. Such 
qualification was added in the revised manuscript. 

In any case, as I have argued above, I strongly feel 
Landauer’s principle is irrelevant for the problem of 
evolutionary stability of parasite-free states. Page 6, 
lines 112-113: Yes, cost concerns may in principle put a 
bound on how much energy an organism is willing to 
spend on accuracy of replication. But I wonder to what 
extent this plays a role in practice. For example, evolu- 
tion has led E. coli to evolve its replication error down 
to something like 2 * 10(-10) errors per base, ie. 
about 0.001 an error per entire genome replication so 
that only one in every 1000 divisions leads to ANY 
error made in the copying. If replication accuracy is as 
costly as the authors seem to suggest, what selection 
pressures do the authors imagine could have led E. coli 
to make its error rate so low (keeping in mind that a 
large fraction of mutations are likely selectively neu- 
tral)? It seems that either we have to assume that even 
one point mutation in 1000 genome replications is still 
very costly from the point of view of fitness (which I 
find hard to imagine), or we must conclude that the 
cost of accuracy in replication is apparently not really 
that large in realistic settings. 

Response: E. coli or any extant cellular life form is a 
product of > 3 billion years of evolution during which the 
error rates have been optimized through the evolution of 
diverse repair systems, under the pressure of different 
evolutionary factors, one of the most important being the 
continuous parasite pressure. That said, the high fidelity 
of replication does not at all preclude evolution extensive 
gene loss via recombination that results, in particular, in 
the evolution of parasites with minimal genomes [67], 
e.g. intracellular bacteria, such as, for instance, Buch- 
nerd, a Close relative of E. coli [68]. 
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page 7, lines 141-147: One loophole I can see here is 
that it seems to be assumed that the genotype-phenotype 
mapping is continuous in the sense that different func- 
tional RES solutions not only exist, but are reachable 
through point mutations. It is conceivable that many RES 
solutions exist, but that they are like ‘single spikes’ in 
genotype space so that it is virtually impossible to mutate 
from one functional to another functional RES. 

Response: We believe that there is a typo in this com- 
ment, i.e. the reviewer means RRS not RES. Under this 
understanding, we indeed assume smooth genotype- 
phenotype mapping whereby small changes (not necessar- 
ily point mutations) can have non-negligible effect on the 
efficiency of the RRS. To the best of our understanding, 
this assumption is realistic given the relative simplicity 
and high plasticity of the replication signals. We added a 
comment to this effect in the revised manuscript. 

page 7, line 150: ‘effectively guarantees that it evolves 
into a higher entropy state’ This is just wrong. We are in 
a non-equilibrium situation where replication is itself 
driven by energy dissipation and the second law gives no 
guarantee whatsoever about which way the evolutionary 
process will go in this situation. 

Response: This is a relevant comment, and in response, 
we added the following to the revised text: “An additional 
important and highly plausible assumption predicating this 
conclusion is that the entire replicator system is stable on 
the time scale of RRS evolution, and accordingly, the evolv- 
ing system can be considered to be at quasi-equilibrium.” 

page 7, lines 153-154: ‘survival of the flattest route 
[50]’. I cannot help myself from mentioning here that 
this work largely revisits theoretical results that were 
first published in [69]. I can see another possible loop- 
hole here (apart from the appeal to thermodynamics 
which, again, I am convinced is mistaken). I think it 
would be possible to have an evolutionarily stable system 
in which there is co-existence of a whole ’quasispecies’ 
of replicators and parasites, which includes SOME re- 
plicators that are parasite-free, and some that are not. 
That is, while there is no benefit for a defense system in 
the absence of parasites, there IS when there are para- 
sites around and I believe one could have a stable 
quasispecies-like state in which there are stable ‘wild 
types’ with defense systems that protect them from para- 
sites, that co-existence with ‘mutant’ forms in which the 
defense system works imperfectly and that ARE infected 
by parasites, which are maintained by exploiting these 
mutants. Thus, I believe one could have an evolutionar- 
ily stable system in which SOME of the organisms are 
perfectly defended against parasites. 

Response: These are interesting possibilities but the 
system described by the reviewer is highly complex, 
and as far as we can see, there is no evidence of its 
plausibility. 
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page 9, lines 221-222: “is thermodynamically unstable” 
No. The thermodynamic arguments do not apply to the 
non-equilibrium situation of the replicators. 

Response: As discussed above, the situation can be con- 
sidered quasi-equilibrium, and the thermodynamic con- 
siderations apply. Nevertheless, we toned it down 
somewhat by replacing “thermodynamic instability” with 
the less restrictive “evolutionarily instability” in several 
places including the title and the abstract. 
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